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Abstract 

For linear and Gaussian state space models parametrized by Go £ G C 
R'",r > 1 corresponding to the vector of parameters of the model, the 
Kalman filter gives exactly the solution for the optimal filtering under 
weak assumptions. This result supposes that 9o is perfectly known. In 
most real applications, this assumption is not realistic since 80 is unknown 
and has to be estimated. In this paper, we analysis the Kalman filter for 
a biased estimator 9 of do- We show the propagation of this bias on the 
estimation of the hidden state. We give an expression of this propagation 
for linear and Gaussian state space models and we extend this result for 
almost linear models estimated by the Extended Kalman filter. An illus- 
tration is given for the autoregressive process with measurement noises 
widely studied in econometrics to model economic and financial data. 

Keywords: Kalman filter, Extended Kalman filter, State space mod- 
els, Autoregressive process 



1 Introduction 

Let (CI, T, ¥g ) be a probability space parametrized by 9$ € C R r , r > 1 cor- 
responding to the vector of parameters of the model. We define two real vectors 
{xt,t G N} defined on (CI, T, ¥g ) with value in X and {yt,t S N*} denned on 
(Cl, J",Pe a ) with value in y. The process {x t ,t e N} (respectively {y t ,t £ N*}) 
is called the unobserved signal process (resp. the observation process). 

The Kalman filter (KF) and the Extended Kalman filter (EKF) commonly 
used in some engineering applications have been successfully employed in various 
areas. These filters may be easily understood by reading the first publication of 
Kalman in 1960 [Kal60] or the Bayesian interpretation of Harrison and Stevens 
in 1971 [HS71]. 
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1.1 The (Extended) Kalman filter: motivation 

Let y±, ■ ■ ■ ,ut be the data (which may be either a scalar or a vector) at time 
1, • ■ ■ , T. We assume that yt depends on the unobservable variable Xt- The aim 
of the (Extended) Kalman filter is to make inference about the hidden state Xt 
(which may also be a scalar or a vector) conditionally to the data yi, • • ■ ,yt- The 
relationship between the observed variable yt and the hidden state Xt is linear 
and described by a function h depending on the unknown vector of parameters 
9q. This relation is specified by the following observation equation: 

y t =h(6 ,x t ) +of £t, t>l 

where Et is the vector of noises assumed to be normally distributed with mean 
zero and unit variance, denoted as: St ~ A/"(0, I Uy xrly ) where n y is the dimension 
of the observation space y. 

The hidden state Xt is assumed to be varying with time and its dynamic feature 
is given by the following state equation: 

x t = b(0 ,x t -i) + a^rjt 

where b is a known function and % is the state error assumed to be normally 
distributed with mean zero and unit variance, i.e r] t ~ jV(0, / %x „ t ) where n x 
is the dimension of the state space X . 

In addition to the usual Kalman filter assumptions (see [Kal60]), we also assume 
that the noises et and rj t are independent. 

Hence, this paper is concerned with the following discrete time state space model 
with additives noises: 

f Vt = h(9 ,x t ) +of o et, *>1 /j\ 
\ x t = b(6 , x t -i) + a^it 

Under the usual Kalman assumptions, the model (1) can be rewritten as 
follows: 

f y t = d t (9 Q ) + C 6o x t + a% Q e t t > 1, 
1 x t = u t (6 ) + A 6o xt-i + <Jg Vt, 

If the vector of parameters #o is perfectly known, the optimal filtering pe (xt\yv.t) 
is Gaussian and the Kalman filter gives exactly the two first conditional mo- 
ments: x t = Eg [x t \yi:t] and P t = Eg [(x t —x t )(xt -xt)'\yi:t] where ' stands for 
the transpose. In particular, the Kalman filter estimator is the BLUE (Best Lin- 
ear and Unbiased Estimator) among linear estimators. Nevertheless, in most 
applications the linearity assumption of the functions h and b is not always 
satisifed. A linearization by a one order Taylor series expansion can be per- 
formed and the Extended Kalman filter consists in applying the Kalman filter 
on this linearized model. 

For the EKF, the matrix Cg is the differential of the function h with respect to 
(w.r.t.) x computed at the point (6o,x^) where x^~ corresponds to the condi- 
tional expectation Ee [xt|yi : t_i]. Additionally, the matrix Ag is the differential 
of the function b w.r.t. x computed at the point (^Oi^t-i)- Furthermore, the 
functions Mt(#o) an d dt(9o) are defined as: 
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{ 



u t (0o) = b(6 , x t -i) - Ae a x t - X 
d t (9 ) = h(8 Q ,Xt ) - Cg Xt 



In this paper, we assume that the vector of parameters 9q is not perfectly 
known such that the inference of the hidden state x t conditionnally to y±..t is 
made with errors of specification. This typical case is frequent in practice since 
in general the vector of parameters is unknown and need to be estimated by an 
ordinary method. The resulting estimator can be biased and consequently this 
bias is propagated on the estimation of the hidden state. More precisely, if we 
denote by 9 a biased estimator of 9o such that Eg [9] = 9 = 9q + e where e is 
a fixed and unknown error corresponding to the bias, we want to evaluate the 
propagation of the error a posteriori and of the residues a posteriori given by: 



Many papers concerned the propagation of the initial error on the state 
(xq — xo) through the filter, and, to the best of our knowledge, there don't ex- 
ist in the literature, an analysis of the propagation of the initial errors on the 
vector of parameters. In this paper, we derive an expression of these propaga- 
tions for the Kalman and the Extended Kalman filters. Our main result shows 
that a correlation between the error a posteriori e t and the unobserved state Xt 
appeared at each time t of the filter. The Kalman filter is now a biased estima- 
tor and a new Lyapunov dynamic equation for the variance matrix P t is induced. 

Applications of this result include epidemiology, meteorology, neuroscience, 
ecology (see [IBAK11]) and finance (see [JPS09]). For example, our result can 
be applied to the five ecological state space models described in [PHH10]. Al- 
though the scope of our method is general, we have chosen to focus on the 
so-called autoregressive process AR(1) with measurement noise which has been 
widely studied and on which our main result can be easily applied and under- 
stood. A full illustration of this result is given for a more complex model as 
the Heston model which is very used in finance for pricing options and hedging 
portfolios (see [E1K12]). 

The paper is organized as follows. Section 2 presents the model assumptions 
and states all of the theoretical results. The application is given and discussed 
in Section 3. Some concluding remarks are provided in the last section. The 
proofs are gathered in Appendix 4. 



e t =x t - Eg[x t \yi:t] 
£i = y t -E g [y t \yi:t}- 



(3) 
(4) 
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2 Main result 



2.1 General setting and assumptions 

In this section, we introduce some preliminary main notations and provide the 
assumptions of model (2). 

2.1.1 Notations 

Subsequently, we denote by E t the pair of the unobservable states vectors given 

by f 6 * and by t£t the pair of the observations vectors f ^* where et and 

£t are defined in (3) respectively. Their variances matrix are denoted by £f and 
Tj\ respectively. 

Regarding the partial derivatives, for any function h, [dh/d9] is the vector 
of the partial derivatives of h w.r.t 8. 

Finally, Re denotes o@ <Jg and Qg a denotes a^a^ and are the covariances 
matrix of St and of r\t respectively. 

2.1.2 Assumptions 

We consider the state space models (2) , the following assumption ensures some 
smoothness for the functions h and b. 

(Al) The functions b and h are differentiable with respect to 9q and x. 



2.1.3 Main result 

Before running into the main theorem of this paper, let us explain some existing 
results. It is well known that if the vector of parameters is exactly known, the 
error a posteriori et is given by the following formula: 

et = (/„ x x« x - K t Ce )Ag e t -i - K t {u% Q s t + C eo a^°r] t ) + a^rjt 

where K t is called the Kalman matrix that minimizes the variance matrix P t . 
Under some assumptions on the model (2), a CLT is obtained for et as t tends to 
infinity (see [dNCdL94]). The following Theorem gives the propagation of the 
error a posteriori et and of £t for the Kalman filter and the Extended Kalman 
filter when Oq is not exactly known. In this respect, we further assume that 
assumption (Al) holds true and the usual Kalman assumptions are satisfied. 

Theorem 2.1. Consider the model (2). If e << 1, then: 



et = {In x xn, - K t Co)A g e t -i - Kt{o- e g e t + Cea^rjt) + a^t 

+£ f x (d, t) + T e x (9, t)x t -i + W x {6, t) + o(e) (5) 

with: 
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*20M) = -e ((/„.xn. - - *t^f W - Kt^utV)) (6) 

•^(M) - -e ((J„.xn. - K t Ce)^- K^Ae^j (7) 

- -< (^ ~ - *«^* - (8) 

Additionally, the propagation of £ t is equal to: 

Z t =C e e t +o-%e t + £ e y (e,t)+r y (e,t)x t + W e v (e,t)+o(e) (9) 

f)d r)f^ f)rr e 

£ e v(W = -e-QfV), r e v (0,t) = -e-^-, W y (6,t) = -e^-e t (10) 

Moreover, when the linearity assumption of the funtions b and h is not sat- 
isfied, the formulas above remain true with the notations of the EKF defined in 
Section 1. 

Proof. See Appendix (A). □ 

We note that the terms depending on e: £*(6,t), F*(0,t) and W*(0,i) (resp. 
£ y (6,t), Ty{6,t) and W y (6,t)) are the corrective terms arising from the bias of 
the parameters estimates. 

Besides, we can see in Eq.(5) that at time t, the propagation of the state error 
e t depends on e t _i but also on the true state variable Xt-i- Therefore, the vari- 
ance of the error e t depends on the variance of e t _i but also on the covariance 
between e t _i and Xt-i- 

Theorem 2.1 gives an expression of the error a posteriori e t and of the residues 
£t which can be rewritten as follows: 

e t = x t - Ee[x t \yv.t] 

= (x t - Ee [x t \yi-.t]) + (E eo [x t \yi. t } -E g [x t \yv.t}) 

v v / N ^ ' 

(1) error of estimation (2) correctives terms arising from the bias of parameters. 

Additionally, 

it = xt-MtAyi-.t] 

= (6-Ee„[6|yi:t])+ (EfloKt|»i:t]-EeKt|yi:t]) 

(1) true residues (2) correctives terms arising from the bias of parameters. 

The expression of the correctives terms (2) are given in Corollary 1. 
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Corollary 1. Let e t given in Eq.(5), the mean of the error a posteriori is given 
by: 

E flo [e t |yi :t ] = E eo [x t \yi:t] - ^e[x t \yi:t] 

- K t C s )A g + J^(0,t))Ee o [et_i|2/i:t_i] 
+EZ(9,t)+J*(9,t)E e [x t - 1 \y 1 *- 1 ] + o(e) (11) 

where £*(0,t) and are given in Eq.(6) and Eq.(7). 

Besides, let £ t given in Eq.(9), the mean of £ t is given by: 

^e a [it\vi-.t] = Ee o myi:t\-M^t\yi:t] 

= (Co + F y (e,t))EeM\y 1:t }+£ y (e,t) + F y (6,t)E e [x t \y 1:t ]+m 
where £ y (6,t) and Ty(9,t) are given in Eq.(lO). 

Corollary 1 which is just a consequence of Theorem (2.1) gives a computable 
recursive expression of the expected error Eg [e t \yi-t] ■ Given Ee [e ] one can de- 
duce all the values of this expectation for alH = 1, • • • , T. 



3 Illustration on the linear Gaussian AR(1) model: 

3.1 The model 

Let us consider the linear AR(1) model with measurement noise given by: 

( y t = x t + cr e £ t , t = l,---,T 
\ x t+ i = 4>ax t +<T v Vt+i- 

Since this model is linear and Gaussian we can apply Eq.(ll) in Corollary 1 
to recover the expectation of et when the state Xt is estimated with a biased 
vector of parameters. For this straighforward example, 9q is equal to <f>o- For 
the simulation, we take (j>o = 0.7, = 0.3 and a 2 e = 0.5. 

3.2 Numerical result: 

We run a Kalman filter by assuming that the parameter estimate <f> is biased 
and we take <f> — 0.85, that is e = 0.15. For this model, the functions b and h 
are given by: 

b(6{),x) = fox and h(9o,x) = x 

The variable A§ is equal to and Cg is equal to one. The control variables 
u t (0o) and d t (0o) are equal to zero. 

Furthermore, the functions £*(0,t),F*(O,t) are easily computable and given in 
the following lemma. 
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Lemma 1. For the linear AR(1) model, the functions £^(9, t) and J- t) are 
equal to: 

£*(0,t) = 0, andJ*(9,t) = -e(l-K t ) 

Therefore, by using Eq.(ll) of Corollary 1, the expectation E# [e* is given 
by 

Ee [e*ll/i:t] = ((! - Kt)(4> - e))E 6o [e t _i |j/ 1:t -i] - e(l - tf t )E 9 [a: t _i|j/i :t _i] + o(e) 

(14) 




cant snt a ^ Fnaoes^eps^cQm 

Figure 1: Red: True Error Eg [et|yi : j] x 100. Blue: Approximation (14) x 100. 



This example shows that the approximation (14) explains the true error 
Ey„ let \tji-j] for an easy model. The term T\.{8J) corresponds to the bias of Xt 
induced by the bias of the parameter estimate. Furthermore, we can see that 
the error between the true expectation Eg [et|2/i : t] and the approximation (14) 
corresponds to o(e). A full application is given in [E1K12]. 

The following Theorem regards the expression of the variances matrix Ej 
and of E t and (St respectively. 

Theorem 3.1. The variance matrix is given by: 

1 \s x t If J 

where: 

V? = (I- K t C e )A 6 VUK{I - K t C e j + J*(P)S t -i(I - K t C e )A e 

+(I - K t Ce)AgSt 1 T' x \e)+Tl{e)P x _ 1 F':{9)+Ye[Wl{e)] 
Sf = A eo SUK Q {I - KtC e j + A 6o PU^{0) + Cov e (w^),^) 
P t x = A eo Pf_ x X eo + Q d0 
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with: 



= W£(0) + °eVt ~ K t a e e e t - K t Cga v e r, t (15) 

where and W^(9) are given in Eq.(6), Eq.(7) and Eq.(8) in 

Theorem 2.1. 

Ife « 1, thenY e [W*(9)] = Q e +K t (C e QeC' e + R 8 ) K' t andCovg (w*(6), a^m) 
is given by: 

Cove (mOUlvt) = e (^Qo *£ ~ K t (cg^ + af) Qe o-'£) 
Additionally, the variance matrix Sj is given by 




where: 

V? = CoVfCg + F*(9)S t C e + C e s' t F y %9) + P y {8)P?P y {8) + Y g [W y (8)} 
S y t = C 9o SfC' e + C Bo P'tT y \6) + Covg (w*(9), a e Bo e t ) 
py = C eo P t x C 9o +Rg 

with: 

W* v (9) = W y {fi)+o%e t (16) 
where £ y {9), J~ y {9) and W y (9) are given in Eq.(lO) Theorem 2.1. 

Ife« 1, then V e [VV£(0)] = Rg and Covg (w%(6), a^e t ^ is given by: 

Covg (W y (9),*l s t ) = -e (^^o<) 
Proof. See Appendix (B). □ 

The quantities J£(0,t) and W*(9,t) (resp. J^(0,t) and W*(8,t)) correspond to 
the correctives terms arising from the bias of the parameters and in particular 
from the correlation between ej and the true state Xt (see Eq.(5)). This corre- 
lation induces a new Lyapunov dynamic equation for the variance matrix V t x . 
For unbiased parameters estimates, these terms are dropped and a CLT is given 
in [dNCdL94]. 
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4 Concluding remarks and discussion 



In this paper we provide an expression of the propagation errors on the hidden 
state for an initial and fixed error on the vector of parameters. 
We showed that the hidden state Xt appaered in the propagation equation in- 
ducing a correlation between e t and the true state xt and most importantly a 
new Lyapunov dynamic equation for the variance matrix. By using the same 
assumptions than in [dNCdL94] and adding smoothness assumptions on the 
functions b and h and on their derivatives, one can again obtain a CLT for e t - 
Nevertheless, it is not the subject of this paper. 

Another remark concerns the case where e is not fixed and is supposed to be a 
random variable. This particular case refers to the approach proposed in [HK01] 
for which the parameters are supposed time varying. A dynamical artificial evo- 
lution is assumed for such that 9 t = Qt-\ + cr z Z where Z is a centered and 
standard gaussian random variable. To the best of our knowledge, there does 
not exist results about the convergence of this approach. This method fails in 
practice when the variance a z is not small. Some authors use a z decreasing 
with time. Hence, at each step of the filter, a small perturbation is added to 
the parameters. This can be seen as a small bias e introduced at the first step 
of the filter. 
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A Proof of Theorem 2.1: 



The proof is essentially based on a one order Taylor expansion of the functions b 
and h with respect to 9. 

e t = x t -E e [x t \yi-.t] = x t - ^e[xt\yi:t-i] - K t (y t - yi) 

= u t {9 ) + Ae x t -i + <r r g Q rit - E e [u t (9) + A e x t -i + a^r) t \yi:t-i\ - K t {y t 

= u t (6o) + Ag x t -i + a^r/t - Eg[u t (9) + A e x t -i\yi:t-i] - K t (y t - i)t) 

= u t (6o) + Ae a x t -i + Vo Vt ~ u t {9) - AeEe[xt-i\yi:t-i] - K t (y t - yi) 

= u t (6o) + A 9o x t -i + <Jg o rit - u t {9) - A g x t -i - K t (y t - &~) 

Note that one can write: 

n t (9 )=u t (e)-e^(9) + o(e), Ae =A e -e^+o(e) y = a\ - e^f + o(e) 

Pluging into (17), one gets: 

e t = A e e t -, - e^(0) - e^*t-i + <r> - e^ Vt - K t {y t - fo") + o{e) (18) 
Furthermore, 



(yt-Vt) = d t {9o) + Ce x t +(7e et-'Ee[yt\yi:t-i] 

= d t (9 ) + Cg x t + al e t - d t {6) - C fl E e [a;t|j/i :t _i] 

and 

dt (9 ) = d t (8) - e^(9) + o(e), C(0 ) = C{9) - e^(6) + o(e), a% a = o% Q - e M + o(e) 
So that: 



[yt-yV) = -e^(e) + (al-e^-)e t + (Ce-e^-)(u t (eo)+Ae x t - 1+ al Vt ) 



Rewrite, 
we get: 



89 ' v " 89 
-C e Ee[xt\yv.t-i] +o(e) 

E e [x t \y 1: t-i] = A e E e [x t -i\yi:t-i] + u t {6) 

= CgAgx t -\ - C'eA g xt-i + a%E t + C'evgrjt 

e [WW + CB ^9 Xt -' + -89 £t + Ce ^e (6) + G *-M* 

(dCgdut la , dCgdAg dCg da^j \ 

{-ww (e) + -w^- 1 + -w-aoV + o(e) (19) 



+e 2 



Define, 
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we obtain: 



£t = 2/t - E 9 [t/ t |j/i :t _i] 

= CeAeet-i + a%et + Ce^Vt + £' v - (6>) + J*- {0)x t -i + W y - (9) + o(e) 

By combining Eq.(18) and Eq.(19), we have: 
e t = x t - Ee[x t \yi:t] 

= (/»,», - K t C e )A e et-i - K t a £ e e t - K t C e a v eVt + afc + £1(0) + H{e)x t -i + W e x (9) + o(e) 



where, 




One can deduce the Propagation of the residues a posteriori: 



^ = Vt -E e [y t |j/i:t] 



= d t (0o) + Ce x t + c-g e t - Eg[d t (6) + C g x t + age t \yi-.t] 



= d t (6o) - d t {6) + {C e - e-f£)x t - C e Ee{x t \yi:t] + (al - e-^-)s t + o(e) 



. *, t . i .,-.(5(.) + f, t M.,) t .( ! ) 



By defining: 




Eq.(9) follows. 



□ 



The proof of Corollary 1 is obtained by taking the expectations in Eq.(5) and 



Eq.(9). 
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B Proof of Theorem 3.1: 

By using the system (2) and Eq.(5)-Eq.(9) we can rewrite the model as follows: 

M5 ( »?>M ( '-r* M * «»» 



and, 



Hence, the variance matrix E£ is given by: 



(I-K t Ce)A e T\ 
A e . 



f(I-K t C e )A e J*{0)\ ( V[W£(0)] Co«(W| (0), 



Additionally, the variance matrix Ef is given by: 



Co Fl{e)\ (C e ?i(0)\',( V\Wi(0)] Cov(W*(6),al oVt ) 



- n r< ^ 



Proposition 2.1 gives that: 



a e a e 



Hence, if e << 1, then 

V[W£(0)] = Qe + Jft (C 9 Q 9 C; + Jfc) and V[W*(0)] = ii e 
Furthermore, the covariances are given by: 



/ - \ ( da e \ ( da e \ , 

Covg \VV^(e),a^ o r]tJ = -(.Cove l-^-rjt^^Vt j +e [K t C e -^-Vt,^ Vt J + (Cove {Kt<T%r)t,<Te T )t) 

+ eCove (^K t ^^£ t ,(T% o r)t^ 



do-g i dag , I 

e ~m~Q e ° a <>o + eKtCf >~aQ L Q ( >o cr e V a + £ - K * o e < 30o fT 0o by assum P tion A2 
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Additionally, 



Cov g (w f y (9),al £ t ) = -eCov e (j^*, <r% e t 

— He cre„ 



86 



□ 
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